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DETAILED ACTION 
Response to Amendment 

1 . In response to the office action from 10/7/2004, the applicant has submitted an 
amendment, filed 1 1/3/2004, amending Claims 1, 10, and 13, while arguing to traverse the art 
rejection based on the limitations of amended claims 1 and 13 as per the reasons discussed in the 
teleconference from 10/13/2004 directed towards detecting pauses between subgroups, wherein a 
subgroup is a single word or password rather than words within a sentence (Amendment, Page 
7). The applicant's arguments have been fully considered, but are moot in view of the new 
grounds of rejection in view of Gerson et al (U.S. Patent: 4,870,686), 

Claim Rejections - 35 USC §102 

2. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of apphcation for patent in the United States. 

3. Claims 1-5, 10-15, 18, 20-21, and 25-27 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Gerson et al (U.S. Patent: 4,870,686). 

With respect to Claims 1 and 13, Gerson discloses: 
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Receiving at least a current subgroup of speech units that form part of a complete speech 
sequence that is to be input from a user, the complete speech sequence being embodied as at least 
one of a word and a password comprised of a plurality of alphanumeric characters, the subgroup 
being one or more alphanumeric characters of the complete speech sequence (partial digit 
sequence of a complete keyword, Col 7, Lines 16-25 and abstract) ; 

Detecting a natural pause between input subgroups such that a pause between two 
alphanumeric characters in a given subgroup or a pause between one alphanumeric character and 
a subgroup are detected (pause detection, Col. 7, Lines 16-68); 

Recognizing the speech units of the subgroup to provide a recognition result (recognizing 
speech and displaying and synthesizing the result after a pause, Col. 7, Lines 38-56); and 

Immediately feeding back the recognition result for verification by the user (display and 
synthesis of a recognition result after a detected pause, Col. 7, Lines 27-56). 

With respect to Claims 2 and 14, Gerson discloses: 

A speech recognition method and system, wherein said user is only prompted to repeat 
said subgroup for re-recognition and re-verification if a rejection criteria is met (repeat prompt if 
no keyword is recognized, Col 7, Lines 16-25, and a confirmation to indicate that an input has 
been cleared and further speech can be input upon the utterance of a "clear" command, Col. 7, 
Lines 27-56). 

With respect to Claims 3 and 20, Gerson recites: 

Repeating the steps of Claim 1 for remaining input subgroups until it is determined that 
the complete speech sequence has been recognized (repeating the recognition of partial digit 
strings until the utterance of a terminate command, Col. 7, Lines 56-68). 
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With respect to Claims 4 and 21, Gerson recites: 

A speech recognition method and system, wherein the last step of Claim 1 is affected 
using pre-recorded prompts or via text-to-speech synthesis, (TTS) to feedback the recognition 
result (synthesized recognition results, Col 7, Lines 27-68). 

With respect to Claims 5 and 18, Gerson discloses: 

The rejection criterion is embodied as a negative utterance spoken by the user after 
receiving the fed back recognition result ("clear" command word that negates an undesired 
recognition result, Col. 7, Lines 27-56). 

With respect to Claims 10 and 25, Gerson discloses: 

The speech units are selected from any of spoken digits and spoken letters (spoken digits, 
Col 7; Lines 16-25). 

With respect to Claims 11 and 26, Gerson recites: 

A speech recognition method and system, wherein input of a next subgroup after 
receiving the fed back recognition result indicates a correct recognition of the currently input 
subgroup (Col. 7, Lines 27-56). 

With respect to Claims 12 and 27, Gerson recites: 

The speech recognition unit determines a confidence level for the recognition result 
(distance metric for determining a most likely template match, Col 4, Lines 38-51). 
With respect to Claim 15, Gerson discloses: 

A speech recognition system, wherein the speech recognition unit compares the input 
subgroup with stored recognition grammar in order to determine the recognition result (speech 
recognizer having a digit template, Col 4, Lines 38, and Col 7, Lines 16-68). 
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Claim Rejections - 35 USC §103 

4. The following is a quotation of 35 U.S. C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

5. Claims 6, 17, and 19 are rejected under 35 U.S.C 103(a) as being unpatentable over 
Gerson et al in view of Hou et al (U.S. Patent: 5,325,421). 

With respect to Claims 6, 17, and 19, Gerson teaches the speech recognition system 
capable of detecting pauses between digit segments and prompting a user to repeat a subgroup if 
a digit is not recognized, as applied to Claim 2. Gerson does not teach the ability to include a 
negating speech input in a spoken digit string, however, Hou discloses such a means (cancel 
command immediately following a digit sequence, Col. 10, Lines 42-58). 

Gerson and Hou are analogous art because they are from a similar field of endeavor in 
speech-controlled interfaces. Thus, it would have been obvious to a person of ordinary skill in 
the art, at the time of invention, to modify the teachings of Gerson with the ability to include a 
negating speech input in a spoken digit string as taught by Hou in order to provide a means for a 
user to delete an incorrect speech input while entering a digit string (Hou, Col 10, Lines 47-51), 
thus implementing a more efficient digit recognition processing by bypassing the partial digit 
string verification step as taught by Gerson and applied to Claim 1 . 
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6. Claims 7-8 and 22-23 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Gerson et al 

With respect to Claims 7 and 22, Gerson teaches the speech recognition method and 
system capable of recognizing digit segments through pause detection means to enable, upon 
input of a "clear" command, correction of input and recognition errors, as applied to Claims 2 
and 14. Although Gerson does not specifically suggest prompting a user to input shorter speech 
segments upon repeated recognition errors, the examiner takes official notice that it would have 
been obvious to one of ordinary skill in the art, at the time of invention, to prompt the user to 
input shorter speech segments if repeated recognition errors occur, because shorter utterances 
have less complex speech models and thus, logically, would provide a higher level of recognition 
accuracy. Therefore, prompting a user to input easily recognized, shorter speech segments 
would provide a well-known means of increasing recognition accuracy. 

With respect to Claims 8 and 23, Gerson teaches the speech recognition method and 
system capable of recognizing digit segments through pause detection means to enable, upon 
input of a "clear" command, correction of input and recognition errors, as applied to Claims 2 
and 14. Although Gerson does not specifically suggest prompting a user to input shorter speech 
segments upon repeated recognition errors, the examiner takes official notice that it would have 
been obvious to one of ordinary skill in the art, at the time of invention, that by speaking shorter 
and more easily recognized speech segments, a user would gradually learn the proper way to 
input recognizable utterances. For instance, a user may speak a string of digits too quickly to be 
recognized correctly. By speaking each speech segment individually, the speaker would be able 
to attempt a single utterance segment multiple times and gradually come to understand the proper 
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method of producing recognizable speech. Therefore, prompting a user to speak smaller speech 
segments acts a means of training that user to properly input an utterance. 

7. Claims 9 and 24 are rejected under 35 U.S.C. 103(a) as being unpatentable over Gerson 
et al in view of Larsen ("Investigating a Mixed-Initiative Dialogue Management Strategy, " 
1997). 

With respect to Claims 9 and 24, Gerson teaches the speech recognition system capable 
of detecting pauses between digit segments and prompting a user to repeat a subgroup if a digit is 
not recognized, as applied to Claim 2. Gerson does not teach the ability to enter speech units 
using a dial pad upon repeated recognition errors, however Larsen discloses: 

A speech recognition method and system, wherein if said rejection criteria are met 
repeatedly, the user is prompted to use a dial pad to enter the speech units (ability to switch to 
DTMF input mode upon repeated recognition errors, Page 66-67, Application). 

Gerson and Larsen are analogous art because they are from a similar field of endeavor in 
speech-controlled interfaces. Thus, it would have been obvious to a person of ordinary skill in 
the art, at the time of invention, to modify the teachings of Gerson with the ability to enter 
speech units in a DTMF input mode upon repeated recognition errors as taught by Larsen to offer 
an alternative means of inputting information in a speech interface if a user becomes frustrated 
with repeated recognition errors. 

8. Claim 16 is rejected under 35 U.S.C. 103(a) as being unpatentable over Gerson et al in 
view of Ladd et al (U.S. Patent: 6,269,336). 
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With respect to Claim 16, Gerson teaches the speech recognition system capable of 
detecting pauses between digit segments and featuring a digit grammar for recognition, as 
applied to Claim 15. Gerson does not suggest that the recognition grammar is stored in a remote 
memory accessible by the speech recognition unit, however Ladd teaches such a configuration 
(grammar database at a server, Col. 8, Lines 55-61). 

Gerson and Larsen are analogous art because they are from a similar field of endeavor in 
speech recognition. Thus, it would have been obvious to a person of ordinary skill in the art, at 
the time of invention, to modify the teachings of Gerson with a recognition grammar stored at a 
remote server database as taught by Ladd in order to provide a well-known means of conserving 
system memory in a device with limited storage. 

Conclusion 

9. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: 

McMahan et al (U.S. Patent: 5, 71 7, 743)- teaches speech recognition for a digit password 

string. 

Barclay et al (US. Patent: 5,960,399)- teaches speech recognition implemented over a 
network. 

10. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James S. Wozniak whose telephone number is (703) 305-8669 
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and email is James.Wozniak@uspto.gov. The examiner can normally be reached on Mondays- 
Fridays, 8:30-4:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Doris To can be reached at (703) 305-4827. The fax/phone number for the 
Technology Center 2600 where this application is assigned is (703) 872-9306. 

Any inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the technology center receptionist whose telephone number is (703) 306- 



0377. 



James S. Wozniak 
1/19/2005 




